Generalizing from Example Clusters
نویسندگان
چکیده
We consider the following problem: Given a set of data and one or more examples of clusters, find a clustering of the whole data set that is consistent with the given clusters. This is essentially a semi-supervised clustering problem, but it differs from previously studied semi-supervised clustering settings in significant ways. Earlier work has shown that none of the existing methods for semi-supervised clustering handle this problem well. We identify two reasons for this, which are related to the default metric learning methods not working well in this situation, and to overfitting behavior. We investigate the latter in more detail and propose a new method that explicitly guards against overfitting. Experimental results confirm that the new method generalizes much better. Several other problems identified here remain open.
منابع مشابه
Using data envelopment analysis (DEA) to improve the sales performance in Iranian agricultural clusters by utilizing business networks and business development services providers (BDSPs)
Business clusters play an important role in developing and improving the economic performance of countries and in promoting the welfare of people. Business development service providers (hereafter referred to as, BDSP) have a considerable role in providing specialized services pertinent to the conditions of active enterprises in clusters and in promoting their performance level in order to impr...
متن کاملInterspecies interactions of halophilic and halotolerant actinomycetes: An example from a salt
Interspecies interaction of actinomycetes will express new gene clusters and may therefore affect the pigmentation, sporulation and production of secondary metabolites. Actinomycetes strains were isolated from Howze Soltan Salt Lake. Binary actinomycete interaction assay was conducted to evaluate its effect on colony morphology and antibiotic production. The molecular identification of the indu...
متن کاملGeometric and Electronic Structures of Vanadium Sub-nano Clusters, Vn (n = 2-5), and their Adsorption Complexes with CO and O2 Ligands: A DFT-NBO Study
In this study, electronic structures of ground state of pure vanadium sub-nano clusters, Vn (n=2-5), and their interactions with small ligands for example CO and triplet O2 molecules are investigated by using density functional theory (DFT) calibration at the mPWPW91/QZVP level of theory. The favorable orientations of these ligands in interaction with pure vanadium sub-nano clusters were determ...
متن کاملCluster-Lift Method for Mapping Research Activities over a Concept Tree
The paper builds on the idea by R. Michalski of inferential concept interpretation for knowledge transmutation within a knowledge structure taken here to be a concept tree. We present a method for representing research activities within a research organization by doubly generalizing them. To be specific, we concentrate on the Computer Sciences area represented by the ACM Computing Classificatio...
متن کاملP75: A Study of Perfectionism, Anxiety Sensitivity and Sleep Disturbance in the Generalizing Anxiety Disorder and Normal People
Perfectionism, anxiety sensitivity and sleep disturbance are among the main causes of generalizing anxiety disorder. This study aims to compare perfectionism, anxiety sensitivity and sleep disturbance between patients with generalizing anxiety disorder (GAD) and control group. The present study was a cross-sectional and ex-post facto investigation (causal comparative method). Statistical univer...
متن کامل